[Oneshot] Oneshot Refactor #1041

horheynm · 2025-01-07T15:07:27Z

SUMMARY:
Edit oneshot pathway to use only the necessary code.

Problem:
Oneshot uses a pathway that is shared in other entrypoints, eg. train.
Oneshot doesnt train, so do not need use code that trains, as an example of redundant code. Remove other parts of the code and refactor

Design

Changes:

Oneshot class responsible for carrying out the oneshot pipeline
Oneshot run will not depend on the session. Only use CompressionLifecycle that has modifer logic support. Places that uses active_session, such as saving to the compressed model by reading the stagemodifiers, has been replaced by passing in the necessary args

TEST PLAN:
Pass existing passing tests
Pass all finetune tests merging transformer main for HFQuantizer support
Verified new pathway has the same scales for the int8 w8a8 dynamic per token values for Meta-Llama-3-8B-Instruct

github-actions · 2025-01-07T15:07:40Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

…sor into oneshot-refac-1

dsikka

It seems like we're introducing more abstractions, not fewer, when separating out the oneshot entrpoint and I'm not sure these are necessary.

we also seem to be repeating a lot of code everywhere.

examples/quantization_w8a8_int8/llama3_example.py

examples/quantization_w8a8_fp8/llama3_example.py

src/llmcompressor/transformers/finetune/session_mixin.py

src/llmcompressor/transformers/finetune/text_generation.py

src/llmcompressor/transformers/finetune/model_args.py

src/llmcompressor/transformers/finetune/trainer.py

src/llmcompressor/transformers/calibration/oneshot.py

dsikka · 2025-01-08T18:37:48Z

SUMMARY: Edit oneshot pathway to use only the necessary code.
Problem: Oneshot uses a pathway that is shared in other entrypoints, eg. train. Oneshot doesnt train, so do not need use code that trains, as an example of redundant code. Remove other parts of the code and refactor

Changes:

Oneshot run will not depend on the session. Only use CompressionLifecycle that has modifer logic support.

Return model as a return type to oneshot to obtain model without relying on session.

TEST PLAN: Pass existing passing tests Verified new pathway has the same scales for the int8 w8a8 dynamic per token values for Meta-Llama-3-8B-Instruct

Please update the PR desription showing what the updated entrypoint looks like as a result of this PR.

…sor into oneshot-refac-1

horheynm · 2025-01-09T13:55:27Z

@dsikka Thanks for looking at the pr early. Yes thats the next to do before the full review. One ready i will ping you!

…gs and popualte model_args to avoid collision

…sor into oneshot-refac-1

…une.py

horheynm · 2025-01-15T22:30:56Z

tests/llmcompressor/transformers/gptq/test_oneshot.py

@@ -75,7 +75,6 @@ def test_oneshot_application(self):
            model=self.model,
            dataset=self.dataset,
            output_dir=self.output,
-            overwrite_output_dir=True,


Only used in HF training args
https://github.com/huggingface/transformers/blob/main/src/transformers/training_args.py#L819

horheynm added 2 commits January 7, 2025 08:53

init

276b779

decouple main and successful fp8 run

c690043

horheynm added 6 commits January 7, 2025 14:56

remove stage runner

166e4df

run calib

40c73eb

Merge branch 'main' into oneshot-refac-1

7747bd6

potential non use of session

3b7fd6a

Merge branch 'oneshot-refac-1' of github.com:vllm-project/llm-compres…

b3031c0

…sor into oneshot-refac-1

get rid of session, use oneshotclass

1cd3d90

dsikka requested changes Jan 8, 2025

View reviewed changes

horheynm added 6 commits January 8, 2025 13:39

pass existing tests

a5d0fd7

Merge branch 'main' into oneshot-refac-1

33e1b16

pass finetune tests not dep on HF release

e7407b9

Merge branch 'oneshot-refac-1' of github.com:vllm-project/llm-compres…

d352e4c

…sor into oneshot-refac-1

remove unnecessary changes 1

bc532e7

remove duplicate code

137c02e

horheynm added 3 commits January 9, 2025 15:56

remove duplicate code, set output_dir and save_tensors as training_ar…

6d5cdbc

…gs and popualte model_args to avoid collision

pass tests pre HFQuantizer check

2c7c5f0

lint

324fc99

horheynm changed the title ~~[Draft] Oneshot entrypoint refactor~~ [Oneshot] Oneshot Refactor Jan 10, 2025

horheynm added 3 commits January 9, 2025 23:50

oneshot

0e34ad3

add __all__

9a6a87f

add init

54e8fd0

horheynm mentioned this pull request Jan 10, 2025

[Test Run] Oneshot refactor + hfquantizer #1055

Draft

horheynm added 4 commits January 14, 2025 11:22

Merge branch 'main' into oneshot-refac-1

01eff29

move private below non-prov

b20d6b8

Merge branch 'oneshot-refac-1' of github.com:vllm-project/llm-compres…

7e84319

…sor into oneshot-refac-1

pass tests/llmcompressor/transformers/finetune/test_oneshot_and_finet…

3547baf

…une.py

remove redundant code

976814f

horheynm commented Jan 15, 2025

View reviewed changes

horheynm added 10 commits January 15, 2025 17:36

remove training_args, use session not local lifecycle

59d5d63

move args

b5f75d5

simplify inputargs to oneshot

bd1385e

clean up **kwargs of Oneshot

d52dbf3

better doc strings

0060b63

add docstrings, retire apply

9eaf4c2

revert exampels script

77d15a4

remove apply from sessionmixin:

d5d34f6

remove comments

73e4d7b

Merge branch 'main' into oneshot-refac-1

e1bdffd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Oneshot] Oneshot Refactor #1041

[Oneshot] Oneshot Refactor #1041

horheynm commented Jan 7, 2025 •

edited

Loading

github-actions bot commented Jan 7, 2025

dsikka left a comment

dsikka commented Jan 8, 2025

horheynm commented Jan 9, 2025

horheynm Jan 15, 2025

[Oneshot] Oneshot Refactor #1041

Are you sure you want to change the base?

[Oneshot] Oneshot Refactor #1041

Conversation

horheynm commented Jan 7, 2025 • edited Loading

github-actions bot commented Jan 7, 2025

dsikka left a comment

Choose a reason for hiding this comment

dsikka commented Jan 8, 2025

horheynm commented Jan 9, 2025

horheynm Jan 15, 2025

Choose a reason for hiding this comment

horheynm commented Jan 7, 2025 •

edited

Loading